ON PERMANENTS OF RANDOM MATRICES WITH POSITIVE 

ELEMENTS 



TONCI ANTUNOVIC 

Abstract. We study the asymptotic behavior of permanents of n x n random matrices A 
with independent identically distributed positive entries and prove a strong law of large num- 
bers for log per A. We calculate the values of the limit Yrnin^ao under the assumption 
that elements have power law decaying tails, and observe a first order phase transition in 
the limit as the mean becomes infinite. The methods extend to a wide class of rectangular 
matrices, ft is also shown that in finite mean regime the limiting behavior holds uniformly 
over all submatrices of linear size. 



1. Introduction 

The permanent of an m x n matrix A (height m and width n) satisfying m < n is defined 

as 

n 

per A= ^2 II^X*)' 

where S mn is the set of one-to-one functions from [m] = {1, . . . , m} to [n] = {1, . . . , n}. When 
m = n, that is when A is a square matrix, S mn = S n the set of permutations on [n\. 

In this paper we will study asymptotics of permanents of large matrices with positive, 
independent and identically distributed elements. Permanents of random matrices of similar 
type have been studied in a number of papers. 

In [2] and [3] Girko proved that 

log I per A\ — E log I per A\ 
hm > 0, 

(without estimating Elog|perj4|) for nx n square matrices A with independent elements, 
when either characteristic functions or Laplace transforms of elements is of the form exp(— c|t| a ) 
(and some other finite man cases). Working in the context of perfect matchings on random 
bipartite graphs, Janson [B] proved central limit theorems for permanents of matrices with 
0-1 iid elements. In a series of papers Rempala and Wesolowski studied the permanents of 
large rectangular matrices with identically distributed elements of non-zero mean and finite 
variance (allowing some correlation among elements in each column). In the case of iid el- 
ements, relying on earlier results of van Es and Helmers [1] and Borovskikh and Korolyuk 
[7j, they proved central limit theorems [lOj for (per A)/M per A and later certain strong laws 
of large numbers [TT]. See also Chapter 3 in |12j for a self-contained discussion of these re- 
sults. Recently Tao and Vu [13J obtained significantly different behavior for nxn matrices A 
with independent mean zero Bernoulli ±1 elements. They showed that with high probability 
\per(A)\ = n ^ +0 ^ n . 
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The above results demonstrate the contrast between the non-zero mean, finite variance 
case and the Bernoulli case, which can be summarized as 

, . j. log | per A\ f 1, in the case of the finite variance and non zero mean, 
m,n->oo mlogn \ \-, in the Bernoulli case with zero mean for m = n. 

In the non-zero finite mean case (u being mean of the elements) the value of the limit, and 
especially the upper bounds, can be inferred by calculating the first moment E(per^4) = 
and in the Bernoulli case from the second moment E(| per ^4| 2 ) = n\. In this paper 
we will calculate the value of this limit under the assumption that elements are positive and 
have power law decaying tails P(£ > t) = £~ V/'+oC 1 ) . In the case (3 > 1 elements have infinite 
mean which prevents us from guessing the value of the limit. Actually in Theorem [T] we will 
observe a first order phase transition in the limit at j3 = 1, when the mean becomes infinite. 



2. Setup and the Result 

In the text we will assume that for m < n, A mn is an m x n matrix {A n when m = n) with 
independent positive elements distributed as £. Note that we will drop the subscripts when 
there is no confusion. Assuming that the matrices are constructed on a common probability 
space, theorems below give strong laws of large numbers for lo ^^^" (in particular they 
imply a weak law of large numbers without the assumption that A n are given on a common 
probability space) . Extensions to rectangular matrices are given in Section [5j 

Theorem 1. Let £ be a positive random variable satisfying 
(2) lim g 8 = -I 

V ' f^oo log* f3 

for some /3 > 0. If {A n ) n is a sequence n x n matrices on a common probability space with 
elements which are independent and identically distributed as £, then almost surely 

fQ\ v lo gperA n 

IS) lim =max(l,p). 

n-»oc n log n 

Random variable ^ in ([2]) has finite variance for j3 < 1/2, finite mean for f3 < 1, and infinite 
mean for f3 > 1 when we observe a limit different from the values in ([T]). 

The following result generalizes the case f3 < 1. It does not require the finite variance 
assumption and gives the general lower bounds and the upper bounds in the case of finite 
mean uniformly over all submatrices of linear size. Note that for an m x n matrix A = (a^) 
any matrix B = (ay)igj j^j, where / C [m], J C [n] is called a submatrix of of A n . 

Theorem 2. Assume that (A n ) n is a sequence ofnxn matrices on a common proba- 
bility space with elements which are independent and identically distributed as £ ; and 
let < a < 1. 
i) We have 

(a\ v ■ r ■ lp g P er B ^ 1 
(4) iiminf mm > 1. 

rwoo (k,B) k log k 

where the minimum is taken over all integers an < k < n and all k x k submatrices 
B of A n . 
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ii) If £ has a finite mean then 

/r\ v log per B 

(5) lim sup max — - — — = 1, 

„^oo (k,B) klogk 

where the maximum is again taken over all integers an < k < n and all k x k 
submatrices B of A n . 

Condition ([2]) in Theorem[T]is satisfied with j3 > 1 for many common heavy tail distributions 
including Pareto distribution, Levy distribution, Inverse-Gamma distribution, Beta-prime 
distribution and many more. We are particularly interested in the case when £ has Pareto 
distribution with parameter /3, that is P(£ > t) = t~ 1///3 for t > 1. Actually in Section [3] the 
upper bounds in Theorem [1] will be proven in the Pareto case and then extended to the general 
case via simple stochastic domination. Note that when the convergence ([2]) fails to hold, one 
cannot guarantee the existence of the limit in ([3|) (see Example 1161 in Section [5]). However, 
the upper bound on the lim sup in ([2]) will imply the upper bound in ([3]) and similarly the 
lower bound for lim inf. 

Remark 3. From a more combinatorial point of view permanents can be interpreted in the 
context of saturated matchings (or perfect matchings for m = n) of bipartite graphs. For a 
bipartite graph G = {V,E) let V = V\ U V2, |Vi| < | V2 1 be a decomposition of the vertex sets 
into subsets so that no two vertices in Vi are connected by an edge. Saturated matchings 
of G can be defined as subsets M. C E of the edge set with the property that every vertex 
is adjacent to at most one edge in A4 and that every vertex in the smaller component V\ is 
adjacent to at least one edge in M. For m < n and an m x n matrix A = (ay) containing only 
elements and 1, construct a bipartite graph G with m + n vertices {v±, . . . v m ,wi, . . . w n } 
so that Vi and Wj are connected by an edge if and only if aij = 1. Clearly every one-to- 
one function ir : [m] — > [n] for which YYl = i a iir(i) 1S non- vanishing corresponds to a saturated 
matching on G in 1 — 1 manner. Therefore per A is equal to the number of saturated matchings 
on G. For general matrices A one can construct the graph by drawing an edge between Vi and 
Wj whenever a%j 7^ and putting the weight on this edge. Then per A can be interpreted 
as the total weight of all the saturated matchings on G (a weight of a matching being the 
product of the weight on its edges). All the results in this paper can be interpreted in this 
way. 

In the following section we prove the upper bounds in Theorem [T] and in Section H] we 
provide the lower bounds and prove Theorem [2j In the last section we will extend the results 
to a large class of rectangular matrices and show an example demonstrating that, in general 
without ([2]) Theorem [T] fails to hold. 

3. Proof of the upper bounds in Theorem CD 

The exact calculations needed for the proof of the upper bounds in Theorem [T] are easier 
to perform when we are given a concrete distribution of £ to work with. The proof will be 
provided for the Pareto case, but first we will see how this yields the upper bounds in Theorem 
Q] for the general case. 

Remark 4. Throughout the paper we will use the following two simple observations. 

i) For any m x n matrix A and Aelwe have that per(Aj4) = A m per A Thus the value 
of the limit of log per A n /(n log n) in Theorem [Tj (as well as liminf and lim sup) is unchanged 
if we replace the generic random variable £ by random variables A£, for any A > 0. 
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ii) Assume that we are given two random variables £1 and £2 with right continuous cumu- 
lative distribution functions F\(t) and (and denote Ff(t) = lim s ^Fj(s)). If £2 stochas- 
tically dominates £1, that is F 2 \t) < Fi(t) for any t > (equivalently P(fi > t)< P(£ 2 > *)), 
then one can enlarge the probability space 0, that supports £1 and construct a version of £2 
on the larger probability space which dominates £1 pointwise. For example, if £1 is defined 
on (fi,P) then on (O x [0, 1],P x dX), where dX is the Lebesgue measure on [0, 1], we define 
u = AFi(£i) + (1 — X)F^~(^i). It is easy to check that u is uniformly distributed on [0,1]. 
Knowing this it is also easy to check that £ 2 = ^{{t : > u } has the same distribution as 
£2 and that F^(^i) < u < ^(£2)- The last inequalities imply that £1 < £ 2 everywhere. Thus 
if (Al^ ) is a sequence ofmxn matrices with independent elements distributed as £1 defined 
on a common probability space, one can enlarge the probability space and construct on it 
a sequence ofmxu matrices (A^ n ) whose elements are independent and distributed as £2 
such that for every n, the ijth element of n is not larger than the corresponding element 
of A^ nn . In particular, if £1 and £2 are almost surely positive then per^4^ n < per^4^ n . The 
analogous claim holds when £1 dominates £2. 

Proof of the upper bounds in Theorem^ assuming it holds for the Pareto case. Fix e > and 
take M > 1 so that P(£ > t) < t-VC/He) holds for all t > M. Denote by 1 p+e a Pareto dis- 
tributed random variable with parameter /3 + e and observe that P(£ > t) < P(M£ jS+(E > i) 
holds for all t. Assuming the statement holds for the Pareto case, Remark 0] implies that 
almost surely 

log per A n 

hm sup < p + e. 

n n log n 

Since e > was arbitrary the claim follows. □ 

The rest of this section is devoted to the proof of the upper bounds in the Pareto case in 
which we show explicit calculations. A useful observation which we will use extensively is the 
fact that if £ is a Pareto distributed random variable with parameter /3, then Y = (log£)//3 
has exponential distribution with rate 1, that is F(Y > t) = e~* for t > 0. We start by 
proving some basic estimates for maxima of independent exponential random variables. 

Lemma 5. Let n > 2 and Yi, 1 < i < n be independent exponential random variables with 
rate 1 and R = max j<^ Y \ 

log n 

i) For any positive t 

(6) F(R < t) < exp(-n 1- *), and ¥(R > t) < n 1 ^. 

ii) The expectation of e R can be bounded as 

1 

log n — 1 

Proof, i) Both inequalities are straightforward. First we calculate 

(8) F(R <t) = P(max Y { < tlogn) = (l-e- tlogn Y = (l -n" 4 )™. 

l<i<n V J 

Now applying the inequality 1 — x < e~ x to the right hand side of ([8]) we obtain the first 
inequality in ([6]). Using ([8]) and the inequality (1 — x) n > 1 — nx which holds for all positive 
integers n and all < x < 1 we prove the second inequality in ([6]): 

F(R > t) = 1 - (1 - n - *)" < n l -\ 



(7) E(e R ) <exp( 1 + 
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ii) Using the second inequality in ([6]) we obtain for t > 1 

p (e R > t) = F(R > logt) < -^—r = -y— — , 

V — / v — e> / — ^log t £log n ' 

from where we get 
E (e R ) = F(e R >t)dt<e + -^-dt = e + — ^ =-U r 

K J Jo V - / - ^ t logn logn _ x e logn-l 

r ) <expfl + - 3—-) 

n — 1 / \ log n — lj 



e( 1 + . 

log n 



□ 



The idea of the proof of the upper bounds in the Pareto case is to estimate (by evaluating 
the expectation) the number of permutations it for which the product YYi=i w ^ n ^ e m 
some given interval. The key estimate is provided in Lemma [7J We will only consider the 
intervals not exceeding (ny/\og n) l3n , since as the following lemma shows, the largest product 
Eli £wr(i) typically does not exceed this value. 

Lemma 6. If are independent exponential random variables with rate 1 then for any 

A > 

co / n 1 1 \ 

(9) m^^y vW >nlogn + n^ < oo. 

?1=2 \ ™ 8=1 / 

Proof. Denote Ri := maxi 1 ^^ t " — . From the definition of R{ it is obvious that 

n / n \ 

^ e6 " i=l \i=l / 

Using the inequality (J7|) we have 

™ / , ^ , log log n \ TO / „ log log n \ 

F max V Y iMi) > n log n + n g-g < P V > n + n ° 



<E(e^ Ri ~ n ~ n 



log log ?i 
A log n 



E(e R i) Y // 1 log log \ 

< exp — In. 

\ \ log n — 1 A log n / / 



\e(logn) A1 °s n 

The right hand side above is summable in n which proves the lemma. □ 
Lemma 7. Let (Xi,j)i,j ^ e independent exponential random variables with rate 1 and 



I 7r G S n : (k - l)n < ^2 Y i,Tr(i) < kn \ 
^ i=i ' 



(10) Z n , k 
Then for any 7 > 1 and A > 1 we have 

(11) X>( |J {Z n , k > E(^, fc ) 7 }) < 00. 

n=2 l<fc<logn+ logl ° B " 
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Proof. First by Markov's inequality 



( \j {z n , k > E(z n , fe ) 7 }) < p ( z «,fc >nz n ,kV) 

V-. — » ^ n Incrlntrn ' ^ n l.Oglog/1 

A 

1 



•l<fc<logn+ ' ogl A og " l<fc<logn+ '° sl A os " 



s E 



1 7 1 i loe log n 

l<k<\ogn-\ — a 



E(Z n , fc )7-i- 

Let's calculate the expectation of Zn,fe- Clearly for any fixed it £ S n we have 

E(Z n , fe ) = n\F ({k -l)n<J2 < kn^j . 

^ i=l ' 

Since for a fixed it, Ya=i ^,7r(i) is the sum of n independent exponential random variables 

with mean 1, it has Gamma density x ™ n tiy X ■ Therefore the expectation of Z n k is given by 

rkn -n—lp—x rkn 

(12) E(Z n>k ) = n! / — dx = n x n ~ x e~ x dx. 

J(k-l)n [ n - J-J 1 J(k-l)n 

In particular we have 

rkn 

(13) E(Z n , fc ) < e - {k ~ 1)n / nx n ~ x dx < (kn) n e~ {k ~ l)n . 

J{k-l)n 

Furthermore by (j 1 2[) 

rkn rkn 

(14) E(Z n>fc ) = n \ x n - l e~ x dx > e~ kn / nx n ^dx = e~ kn n n (k n - (k - l) n ). 

In particular E(Z ni i) > (f)™, which implies that the series Yln°=i ^(-^™,i) 1-7 converges to a 
finite limit. Therefore we are left to prove 



(«> E E 



n=2 2<fc<logn+ losl ° gn 



E(^ n , fc )^- 1 



< oo. 



Again using (|14p and the inequality k n > (k — l) n + n(& — l) n 1 we obtain 

E(Z njfc ) > e" fcn n n+1 (A; - l) n_1 > e- kn n n (k - l) n , 

from where 

1 e «(7-l) gfen(7-l) 

( - 16 ' ) ^ E(Z n fc)^ 1 - n™^- 1 ) ^ fc"(7-i) ' 

2<fc<logn+ logl A ogn ' l<fc<logn+ logl A ogn 

The function = e tn ^' y ~ 1 H~ n ^ / '~ 1 ' is convex since 

e tn( 7 -i) 



g"(t) = n ( 7 - l) t „ (7 _ 1)+2 (n( 7 - l)(t - 1)' + 1) > 0. 



Therefore for any 1 < k < log n + log ^ g - we have 
(!7) r^7— n = g(fc) < max^ g(l),g log re + 
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For any n large enough we have 

(. loglogn\ ^^-^(logn) 2 ^ / n \ 71(7-1) 

and, for such n, using flTTJ) we also get 

e *n(7-i) / log log n\ n^T-^Qogn) 2 ^ n ™(7-i) 

fc"(7-l) - 5 V + ^ J ~ (logn+ S^M " (l og „)n(7-l)(l-l/A)- 

Thus, for n large enough (|16j) yields 

1 e «(7-i) n »(7-i) / bg log n 

, E(Z nfc )7-l " n«(7-l) (log„)n(7-l)(l-l/A) " + A 

2<fc<logn+iH^i 



"■(7 i) / log log n 

log n + 



(logn^-VAy V A 

The expression on the right hand side is summable in n which proves (|15p and thus also 

(HID- □ 

Now we are ready to finish the proof of upper bounds. 

Proof of the upper bounds in Theorem^ for the Pareto case. Define Z n & as in (jlOp and fix an 
arbitrary 7 > 1. Lemmas [6] and [7] and Borel-Cantelli lemma imply that almost surely there 
exists a positive integer no such that for all n > no we have 



^ nloglogn 

max 2 ^i,7r(i) < n l°g n H ^ ano - 

i=l 

log log n 



■Zn.fe < E(Z„ jfc ) 7 , for each 1 < k < logn + 
Using f)13|) the following inequalities are almost surely satisfied for n large enough 
(18) perA= 5^ e^^*=i y «.»W < ^ e^ fcn Z„ ifc < ^ e^E^, 

n £Sn l<fc<logn+ logl ° gn l<fc<logn+ logl ° g7 ' 

< e 7n n 7 ™ ^ }fin e ifl-i)kn 

l<fc<logn+ logl ° g " 

log log n^ /_ 7 „ (i3 - 7 ) T „ 



< e 7 "n 7n log n + & & max r 7n e 

V 2 / l<r<logn+ logl ° g " 

If /3 > 1 and 7 is such that /3 > 7 > 1, r 7n e^" 7 ^ rn is an increasing function in r and thus for 
n large enough 

^ -vn 5nn \ (g-7)w A , loglogn\ 7 ™ +1 
per yl < e 7 n p (log n) 2 MograH j 

This yields 

log log n 



log per A < 7 /3- 7 loglogn + / + 1\ log (tog n + — T 



n log n log n 2 log n \ n J log n 
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from where clearly 

log per A 
limsup — < p. 

n^oo n log n 

In the case /3 < 1 we want to maximize the function r 7n e^ _7 ^ rn . Write e h ^ := r 7n e^ _7 ^ Tn . 
We get 

h(r) = 7 n log r + (/3- 7 )rn, ti(r) = ^ + Q3 - 7 )n, &"(t) = < 0. 
Therefore function /i is concave and the maximum occurs when h'(r) = 0, that is when 
t = — at which the value of the function e h ^ is equal to ( jz^rz ) ■ From f[T8|l we get 



VVXXXV.XX .n. .«u, wx ,xx c xxxxx^xwxx o xo ^ - \ (7 — /^) e 

log log n\ / 7 x 



and so 



per A < n 7 ™ log n + , , 

V 2 / V7-/3 

log per ^ lQ g ( lo S n + l5I ^ £IL ) i g 7 _ io g ( 7 - /3) 

<7H ; - + 1 -■, > 7> 



nlogn nlogn logn 

as n — > oo. Since 7 > 1 was arbitrary the claim follows. □ 

4. Lower bounds and the proof of Theorem [2] 

In this section we prove Theorem [2] as well as the lower bounds in Theorem [TJ An important 
ingredient is the use of stochastic domination to reduce certain technical issues to iid 0, 1 
matrices. The following result proven by Hall [4] and Mann and Ryser in [8] provides lower 
bounds for permanents of such matrices (see also Theorem 1.2 in Chapter 4 of [9]). 

Proposition 8. Let A be an m x n matrix, m < n whose all elements are equal to or 1. 
Assume that each row of A contains at least k elements equal to 1. Ifk> m, then 

(19) perA> k - 

(k — my. 

If k < m and per A > then 

(20) per,4>/c!. 

As discussed in the introduction (see Remark[3]) permanents of matrices with 0, 1 elements 
can be viewed as the number of saturated matchings on corresponding bipartite graphs. To 
ensure the positivity of the permanent, when applying (|20p . we will exploit this connection 
through the classical Hall's marriage theorem, which can be easily stated in this setting (see 
0). 

Theorem 9. Let G = (V, E) be a bipartite graph and let V = V\ U V2 be a decomposition 
of the vertex set so that no two vertices in Vi are connected by an edge, i = 1,2. Assuming 
\V\ I < 1^2 1; there exists a saturated matching on G if and only if for any subset W C V\ we 
have \W\ < \{v : v ~ w, w G W}\. 

Restating the above theorem in terms of permanents of 0, 1 matrices yields the following 
lemma. 

Lemma 10. Let B be an m x n matrix whose all elements are either or 1. If for any 
1 < k < m any k x (n — k + 1) submatrix of B has at least one element equal to 1, then 
per B > 1. 
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All the necessary applications of Proposition [8] and Lemma [10] are summarized in Lemma 
[12] which, in particular, proves the lower bounds in Theorem [2] 

Remark 11. Recall that Stirling's formula says that 

lim n\e n n-( n+1 'V = v 7 ^. 

n— >oo 

In particular there are constants c\ < c 2 so that for any n and 1 < k < n — 1 

n n+l/2 / X n n+l/2 

21 c l7 - -— ———< \<c 2 - 



fcfc+l/2( n _ fc)»-fc+l/2 ~ \kj ~ k k+1 / 2 (n - k) n ~ k+1 / 2 ' 

Lemma 12. Let £ be a positive random variable and let A n be a sequence of n x n matrices 
whose elements are independent and identically distributed as £. For any < a < 1 and any 
5 > £/iere exists r > wt/i i/ie following property: Almost surely there exists no suc/i i/iai 
for any n > no cmd any an < k <n, any kxk submatrix B of A n satisfies per B > r k k^^^ k . 

Proof. Let g > be such that P(£ < o) < rj, where r\ is to be chosen later. Define the random 
variable £ = l^ >q y and define the matrix A n = Let 93 n denote the event that some 

row of A n contains more than a6n zeros and let (£ n denote the event that for some k\ and 
k 2 satisfying an < k± + k 2 there exists a fci x k 2 submatrix of A n containing only zeros. 
By Lemma [10] on the event £^ an y kxk submatrix of A n has a positive permanent, for 
an < k < n. Furthermore on the event 03^ every kxk submatrix of A n for k > an contains 
at least (1 — 5)k ones. Thus on the event 23£j n by (|20p we have for any A; > an and any 
kxk submatrix B of A n 

V erB>q k V erB>q k [(l-5)k\\ > f ^ 1 ~ S ^ ) k k^ k , 



where B is the submatrix of A n having the same rows and columns as B in A n . Note that 
the last inequality above holds for n large enough by Stirling's approximation. 

Thus we only need to prove that the probabilities of the events 23 n U £ n are summable (since 
then they happen only finitely many times almost surely). To end this observe that the average 
number of Is in every row and column of A n is greater than n(l — rj), so for rj < a5 by standard 
large deviation arguments there exists a constant C such that P(23 n ) < Cne~ n ( aS ~' n ^ c , which 
is clearly summable. For £ n use union bound to obtain 



(22 , r„„„ 2 e ^ e e 

"»<ti+t 2 <» V 7 X ' an/2<k 1 <n V 7 l<k 2 <n v 7 



(2r/) fc S 



an/2<ki<n an/2<k\<n 



for n small enough. It is easy to check the last inequality by writing (l + ?y fcl ) n — 1 = 

rj kl Y^eZo (l + f] kl Y an d bounding each of the terms on the right hand side. To prove that the 
right hand side above is summable, observe that for ry = 77(a) sufficiently small the following 
inequalities hold for an/2 < k\ < n 



(2n) fcl/2 < (2n) Qn/4 < (1 - ^)^ a / 2 > < (1 - y/toj) 



n—k\ 
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Plugging this back into (j22j) we get 



ntn)<2 Yl ( fc n )(2r?) fci/2 (i-7^r-' 1 

an/2<k 1 <n ^ X ' 



The right hand side is just twice the probability that the sum of n independent random 
variables having value 1 with probability \/2~r\ and value otherwise, is greater than an/2. 
Choosing y/2rj < a/2, large deviation principle implies that this probability is exponentially 
small, and thus summable in n. This finishes the proof. □ 

Proof of Theorem 0. i) Taking an arbitrary 5 > by Lemma [12] we can find r > small 
enough so that almost surely for n large enough 

log per B logr 

—n — r- > ; — r + (1 - o), 
fc log k log 

for any an < k < n and any k x k submatrix B of ^4 n . Thus almost surely 

. logperS 
lim mi mm — — — > 1 — o. 

n B,k k log k 

Since 5 > was arbitrary the claim follows. 

ii) Let £ be parameter 1 Pareto distributed random variable. By Markov inequality, for all 
t > E(£) we have 

P(£ > t) < ^ = P(E(0l > 

Thus E(^)^ stochastically dominates £ from above and by Remark H] we can construct a 
sequence (-A n ) of random n x n matrices whose elements are independent and identically 
distributed as £, such that for any an < k < n and any k x k submatrix B of A n for the 
corresponding submatrix B of yl n we have peri? < E(£) fc peri?. Thus it is enough to prove 
the claim in the case when we replace £ with £. If B is an arbitrary k x k submatrix of A n 
then after permuting the rows and columns we can assume that B is at the intersection of 
the first k rows and columns. Denote by B c the matrix at the intersection of the other n — k 
rows and columns and observe that since A n has positive elements perA n > peri? per B c . 
Furthermore since all the elements are larger than 1 we have per£> c > (n — k)\ and thus 

peri? per^4 n log(n — fc)! ^ per A n (n — fc)(log(n — k) — 1) — c 



k log k k log k k log k k log k k log k 

for some c > 0, where the second inequality inequality follows from Stirling's formula (note 
that we can assume that k < n, since for k = n the upper bounds have been proven in the 
previous section). By the upper bounds in Theorem [JJ for any e > almost surely there is 
no such that for all n > no we have per^4 n < (1 + e)nlogn. For such n we have 

peri? . .nlogra (n — k) log(n — k) n — k c 
<(l + e) ^-r ~ 7— — - + i r + 



k log k k log k k log k k log k k log k 



(1 + e) log n -j\ _|_ n — ^ | c |_ n ( n ] \ l°g( n — ^) 



log n + log a J k log k k log k k \k J log k 
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For a fixed a the first three terms on the right hand side vanish in the limit and it suffices to 
show that 

fn fn \loe(n—k)\ 
23 limsup max - - - - 1 — ^ — _L < l. 

n-s-oo 2<k<n\k \k J \ogk J 

Denoting n = tk, where t > 1 we have 

n fn \ log(n — k) (t — 1) log(t — 1) 



log k log k 

When taking supremum one can assume that 1 < t < 2, that is k > n/2. In that case the 
claim follows from the fact that s \— > slogs is bounded on (0, 1). □ 

Proof of the lower bounds in Theorem^ The lower bounds for j3 < 1 follow from Theorem [2] 
i), so in the rest of the proof we will assume that /3 > 1. 

First define the random variable Y = (log £) / f3 and observe that 

Km '° gF(ra " = -1, 

t— >oo t 

and thus for any e > we have F(Y > t) > exp(— t(l + e)), for t large enough. Let {Yij) be 
an array of independent random variables distributed as Y, and for a fixed n define 

Qk = max Yfcj, for 1 < k < n. 

l<i<n-k+l 

Following the same arguments as in the proof of the first inequality in ([6j) one can prove that 
for any t' > 



"(Qi < t'logn) < exp( 



-n 



l-t'{l+e)\ 



holds for n large enough. In particular if we fix < t' < 1, e > such that t'(l + e) < 1 and 
< p < 1 then for n large enough 



< t') < ^ ex P ( — (n — + lf-t'W) 



V loefn — A; + 1) 

l<fe<pn v tov 7 l<k<pn 

(24) < pnexp(-(l - (0 ) 1 -* , ( 1 + e ) n 1 - t '( 1+e )). 

We will need this estimate later in the proof. 

Now we will run a greedy algorithm to extract a large submatrix of A n with a large 
permanent. Starting from the first row of A n we pick a largest possible admissible element in 
each row. First define A^ 1 ' = A n and mi as the smallest index between 1 and n, such that 
£i := £i mi = maxi<j< n ^i j j. Next we inductively construct the matrix A^ k+l ^ from A^ by 
deleting the first row and the m^-th column of the matrix A^ k \ At each step £fc is defined 
as the largest element in the first row of the matrix A^ and m/% as the first column which 
contains such an element. Note that conditioned on the elements of the first k — 1 rows of 
the matrix A n the matrix A^ is just an (n — k + 1) x (n — k + 1) matrix with independent 
elements distributed as £. This immediately implies that Cl> • • • j Cn are independent and that 
the first row in A^ k > is distributed as . . . £fc,n-fc+i)- Therefore 

(25) (!2|<l,...,!2|&)i (Ql ,... Qn) . 

Now fix some < t < t' < 1 and t/t' < p < 1. Then for n large enough 

nlogn ;/ 

26 t- -<t'(np-l). 

y ' logn + log(l-p) " v p 1 
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Let k = [pn\ and consider the (n — k) x (n — k) submatrix B = J 4( fc + 1 ) which is at the 
intersection of the last (n — k) rows and columns [n]\{mj : 1 < i < k}. Consider the 
complement submatrix B c which lies at the intersection of the first k rows and columns 
{nii : 1 < i < k). Since Q is the largest element in the i-th row of B c we have per B c > Yl i=1 Ci, 
and by (|25p we have that 



k k 

, „n ^/logperfT \ , \ ™/v^ Qi in log n 

27) P < t < PI > Qi < tnlogn < F > - — — -< B - - 

y ' \ finlogn ~ J ~ \^ J- V^log(n-i + l) ~ log((l-/o)n) 

Now using (|26|) we see that if the event under the probability on the right hand side happens, 
then Qi < if log(n — i + 1) happens for some 1 < i < k. Therefore (I24p and the sub-additivity 
imply that for any e > such that t'(l + e < 1 we have 

(28) P (if^! < t) < E p (w^Tn ^ * ^ ex P(-( x " p) 1 -*'^^'^), 

V pn log n / ^— ' \log(n — z + lj / 7 

for n large enough. Since the right hand side is summable in n we see that almost surely 
^raiogrf — fit happens for only finitely many integers n. 

On the other hand, Lemma [12] implies that for any 6 > there is r > such that almost 
surely 

per B > r n ~ k {n - fc)(l-«><»-*) > (r(l - p) 1 ' 8 ) (1_p)n n (i-«)(i-P)«. 

Since has positive elements, the inequality per A n > per i? per B c holds and thus almost 
surely for n large enough 

log per A n > p t+ , l _ «/j j + (l-p)(logr + (l-^) log(l - g) 
nlogn — ' logn 

Taking the limit as n — > oo and then t j 1 (which forces p f 1) yields the claim. 

□ 

5. Non-square matrices and the necessity of fl2J 

In this section we sketch how the above arguments extend to a large class of rectangular 
matrices. We will still assume that elements are sampled independently from a distribution 
supported on M + , but will now allow the width of the matrix to be significantly larger than 
the height, in particular it will suffice for the height to grow polynomially in the logarithm 
of the width. The precise condition under the method extends is that matrix A n is m n x n, 
that is has height m n and width n, and the height satisfies the condition 

m n log log n 
(log n) 2 

Observe that for an m x n matrix with iid elements of mean p we have E(per A n ) = (^)mlu m 
which demonstrates that the scaling function nlogn will have to be replaced by m n logn. 



(29) lim > f n2 2 T > L 



Theorem 13. Let £ be a positive random variable satisfying ([2]) for some /3 > 0. If (A n ) n 
is a sequence of m n x n matrices on a common probability space with elements which are 
independent and identically distributed as £ and satisfying (f29|) . then almost surely 

(30) hm =max(l,p). 

n-s-oo m n log n 
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The uniformity over all submatrices of linear size holds as well. 

Theorem 14. Assume that (A n ) is a sequence of m n x n matrices on a common probability 
space with elements which are independent and identically distributed as £ and satisfying (I29p . 
and let < a < 1. 

i) We have 

/ q -i \ ,. . f log per B 

(31) Iiminf mm — > 1, 

»woo (kiM,B) KilogA;2 

where the minimum is taken over all pairs of integers (fei,^) satisfying am n < k\ < 
m n , an < k2 < n and k\ < k2 and all k% x k<i submatrices B of A n . 

ii) If £ has a finite mean then 

(32) limsup max -r^T~ — ; — = 1> 

n^oo (k u k 2 ,B) fcl log fc 2 

where the minimum is taken over all pairs of integers (ki, fo) satisfying am n < k\ < 
m n , an <k2<n and k\ < k2 and all k\ x k2 submatrices B of A n . 

The proofs of these theorems are modifications of the arguments in the previous two sec- 
tions. We will briefly sketch how the modifications go and at which points one needs to do a 
more careful analysis. Note that, to simplify notation, we will drop the ceiling and the floor 
notation throughout the section. 

Sketch of the proof of the upper bounds in Theorem \1SX As before, by stochastic domination, 
it suffices to prove the claim when elements are Pareto distributed. To end this one needs 
to prove a version of Lemma [6] which states that when (Yi,j) are independent exponentially 
distributed with rate one and A > 1, we have 

E / log log n \ 
P max } Yi-u) > m n logn + m n < oo. 

n=2 \ i=l / 

Proceeding as in the proof of Lemma [6] one is left to show that 



n=2 



If 1 loglognx \ 

>^exp \m n < oo, 

^-^ \ v log n—1 A log n / 



for some A > 1. By (|29p for A > 1 small enough the expression in the exponent is not larger 
than — alogn for some a > 1 which yields the claim. 
Next one defines the analog of (jlpp as 



(33) 



J n,k 



< it G S mn , n : (k - l)m n < ^ Fj )7r(i ) < km n \ 
^ i=l ' 



and needs to prove (fTTI) . Calculating expectation of Z n ^ as in (fT2|) yields 

(34) ( U )e~ km "m™"(k m " - (k - l) m ") <E{Z nk ) < ( " J e'^™* (km r , 
\m n J \m n J 



14 tonCi antunovic 

For k = 1 this yields E(Z„ )X ) > (^Jmje"™" which handles the sum ^ n > 2 E (^n,i) 1_7 - We 
are left to prove the analog of (|16p . that 

1 e mn(7-l) „ e fcm n (7-l) 

^ ¥,{Z nk Yl- 1 ~ ( n VFIZ m n ( 7 -l) 2^ fc m„( 7 -l) 

is summable in n. Again by the convexity of g(t) = e trrin<a ~ l ^t~ rrin<y ' y ~ 1 ^ and the fact that 
5(1) < <?(logn + ^ log log n), proceeding as before one is left to prove that 



ne 



m ™ (7 x) log n + j log log n 



n \7 - 1 



mnflogn) 1 - 1 /^ J (n\ 

\m n ) 

is summable in n. Since (logn)~ Kmn is summable, for any k > 0, it is enough to show that 
(„? ) > (cn/m n ) mn , for some c > 0. Taking logs on both sides and using ([2T]) (we can assume 
that m n < n) it is enough to show that 

(n-m n + 1/2) (log n - log(n - m n )) > m n logc+ ^logm n , 

which holds for c < 1 and n large enough (since the left hand side is positive and the right 
hand side negative). This proves (fTTj) for (J33J) . 
To finish the proof assume that both 

S^v ^ 1 m n log logn 
max y, H,Tr(i) < m n log n H and 

1=1 

■^n,fc < E(Z„ fc) 7 , for each 1 < k < logn H - — , 

A 

hold for some A > 1, which is true for n large enough almost surely. The same calculations 
as in the proof of of the upper bounds in Theorem [TJ and (|34p yield 

7 



per,4< (™ y^-^ m "m^(logn)V m « e m "T'^i ogn + ^ loglogn y mn 



and 



log per A ^ _ log (JJ i Q _logm„ i /?-7loglogn i 7 i _ log (log +\ log logn) 



< 7 : — — + p - 7 + 7-j 1 : : V : h 7" 



rn n log n m n log n log n A log n log n log n 

The last three terms vanish in the limit and so it remains to prove 

r / lo S (mj . logm n \ 
hmsup — - — h — < 1. 

n->oo Vm n logn logn / 

After applying ()2ip we are left with 

hmsup (JL - (JL - A ^jn-mnh < L 
n \m n \m n J logn / 

which follows from (|23p . For /9 < 1 one can repeat the calculations, or simply refer to 
stochastic domination. □ 

The proof of Theorem [J3] is based on the following equivalent of Lemma [T21 



ON PERMANENTS OF RANDOM MATRICES WITH POSITIVE ELEMENTS 15 

Lemma 15. Let £ be a positive random variable and let A n be a sequence of m n x n matrices 
whose elements are independent and identically distributed as £ and which height m n satisfies 
()29p . For any < a < 1 and any 5 > there exists r > with the following property: Almost 
surely there exists no such that for any n > uq and any pair of integers {k\,k 2 ) satisfying 
am n < k\ < m n , an < k 2 < n, and k < k 2 , any k\ x k 2 submatrix B of A n satisfies 
peri? > r fcl /c2 1 ^ fel . 

Sketch of the proof of Lemma \15[ One follows the proof of Lemma [12j In the definitions one 
needs to write n for the width of the matrix and m n for the height, for example C n is defined 
as the event that for some pair of integers (fci,/^) satisfying 1 < fci < m n , 1 < k2 < n and 
ki + k 2 — an some k\ x k 2 submatrix of A n contains only zeros, and B n is defined as before. 
On 3^ H one has 

f q^[(l-S)k 2 \\, for h > {l-5)k 2 
[9 ([(l-^fcaj-fc!)!' ior /ci < l^l 0J/e 2 - 
In the first case the logarithm of the right hand side will be bounded from below by 
h logq+ (1 - 5)k 2 log((l - 5)k 2 ) - (1 - 5)k 2 > hilogq- 1 + log(l - 6)) + (1 - 5)h logk 2 , 
and in the second 

fcl log q + ((1 - <5)fc 2 + 1/2) log((l - 5)k 2 ) - ((1 - 5)k 2 - h + 1/2) log((l - 5)k 2 - h) + fci 

> fci log((l-5)fc 2 ) + fci(l + logg), 

which is sufficient in both cases. Probability of the event B n is estimated as before. For C n 
one can follow the arguments in (|22p starting with 

™<* e (:) e 

an/2<fc 2 <n l<fci<m n v 1 7 

This inequality follows from the simple fact that m n < n and fci < k 2 imply that (^ n ) (^j < 

(T;)(, n 2 )- 2 1 □ 

Sketch of the proof of Theorem \14\ Lower bounds follow from Lemma [151 For the upper 
bounds again use Markov's inequality and reduce to the case when elements of A n are pa- 
rameter 1 Pareto distributed. Similarly as before observe that for any fci x k 2 submatrix B, 
any term in the sum defining peri? can be expanded in { r ^~-ki)^ mn ~ ^l)' ways to a term in 
the sum defining per A n . Since all elements of A n are greater or equal than 1 we have 

log per B < log per A n - log ( ( U k ] ] (m n - fci)! 



m n - k X/ 

< log per A n - (n - fci + 1/2) log(n - fci) + (n - m n + 1/2) log(n - m n ) + (m n - fci). 

Terms m n — fci, log(n — m n ) and log(n — fci) are o(fci log k 2 ). After disregarding them, by the 
proven upper bounds in Theorem \13\ almost surely for any e > one has for n large enough 

log per B ^ logn / m n m n log n + (n — m n ) log(n — m n ) — (n — fci) log(n — fci) 



fci log k 2 log k 2 \ fci fci log n 

We are left to prove that 

(35) — (n — fci) log(n — fci) + (n — m n ) log(ra — m n ) + (m n — fci) logn < o(m n logn), 
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uniformly in am n < k\ < m n . Since the left hand side (as a function of k\) is increasing on 
(0,n(l — 1/e)) and decreasing on (n(l — 1/e), 1) it suffices to prove (l35j) for k\ = m n when 
m n < n(l — 1/e) and for ki = n(l — 1/e) when n(l — 1/e) < m n < n (assuming a < 1 — 1/e 
which is of no loss of generality). For k\ = m n the left hand side is so there is nothing to 
prove, and for k\ = n(l — 1/e) the left hand side is equal to 

ft 

(n- m n ) ( log n - log(n - m n )) , 

e 

which is not larger than an o(n log n) term, and this suffices in the case n(l — 1/e) < m n < 
n. □ 

Sketch of the proof of the lower bounds in Theorem EE The lower bounds for f3 < 1 case fol- 
low directly from Theorem [14] i) . For the case j3 > 1 one can follow the arguments almost 
verbatim. In (124D one needs to replace pn by pm n in the upper limit in the sum and in the 
factor in the end. Moreover the greedy algorithm is run for k = pm n steps, where p is defined 
as in ([26]) with n replaced by m n (except under logs) . One defines the submatrices B and B c 
as before and arrives at the analog of (|28[) 

togperiP A X p f_gj ;<t')< P m n eM-(l-p) 1 - t ' (1+e) n 1 - t ' {1+e) ). 

,/3m n logn- /- ■ Vlog(n - i i + 1) _ / ~ P v v P; ; 

To finish the proof apply Lemma [15] on B identically as Lemma [T2] in Section [H □ 

This example shows that Theorem [1] in general fails when the limit in ([2]) does not exist. 
Actually this is possible at arbitrary small oscillations of the sequence in ([2]) . We present the 
argument for square matrices. 

Example 16. Let S = {ki} be a set of positive integers labeled so that fcj+i > 2/cj. Fix 
C*2 > C\ > A > 1 and for every k > 1 define the following sequences of positive real numbers 

„h -/ / ! / f expf— A fc /Ci), for k 4 S 

t k = exp(A ), p k = exp(-A /d), and p fc = j ^J^,)^ for fc * & 

Clearly both series X^fcPfc an< ^ Sfc^fc converge, so we can normalize the sequences with the 
its sums Z and Z respectively and obtain sequences p k = p'^/Z and pk = p'^/Z. Let £ and 
£ be random variables supported on the set {tk} with distributions P(£ = t^) = pk and 
^(C = ^fc) = Pk- Observing that the mappings t \-t P(£ > t) and t h-> P(£ > t) are constant on 
(tk,tk+i] and that P(£ > tk) < 2pfc, for all A; G 5 large enough and for infinitely many k S 
as well, and P(£ > tk) < 2p/%, for all k large enough, it is easy to see that 

(36) hmmf = — — - < — — = hmsup , 

t^oo logi Ci/A 6 2 t^oo logt 

r . f logP(f >*) 1 1 V logP(l>t) 

(37) hmml = — — < — — = hmsup . 

t^oo logt Ci/A Ci f^oo logt 

As usual let (A n ) denote a sequence of n x n matrices on a common probability space with 
independent elements distributes as £. We will prove that 

(38) liminf — ^ < C%, and limsup — ^ = 

n nlogn n nlogn 

To get the upper bound on liminf take a sequence {t{) of positive integers such that 
ki < ti < ki + \ and that sequences (£i — ki) and (fcj+i — li) are strictly increasing. Define 
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integers rtj = exp(A^/Ci). By a simple union bound the probability that A n contains an 
element tj>, for some I > h l+ i is bounded from above by 

2n 2 iPki+1 = iexp(2A £ 7C 1 )2exp(-A^VC7 2 ), 

for i large enough. This expression is summable in i, so almost surely A ni does not contain 
elements greater than tk i+1 for i large enough. Thus to prove the first inequality in (j38|) one 
can assume that elements in A ni are distributed as £l(£<t fc . j)- Next observe that fl(£<t fc . x ) 

is stochastically dominated by that is 

While the inequality is trivial for t > tfc j+1 and for t < tfc j? for t^. < t < t^ i+1 it follows from 
the fact that for J C {tk i} tk i+1 ) 

P(£ G J) = ^ fc:tfcgJ ^ fc < ^k'-tk^jPk _ ^ ^ 

since p^. = p^, for tt £ J and Z < Z. Thus if A n is the sequence of n x n matrices whose 
elements are identical and distributed as £ then 

Hm . nf log per A w ^ Hm n t logt fcl +logperi na ^ ^ A*_ + = ^ 
n nlogn i rijlognj t \ li jC\ 

Here the second inequality follows form the upper bounds in Theorem Q] and (|37p . 

To prove the second relation in (j38[) fix G 5", e > and define the integer n = n^ = 
exp((l + e)A fc /C2). We proceed with a greedy algorithm analogous to the one in the proof 
of the lower bound in Theorem [TJ With the probability 1 — (1 — p^)™ there is an element in 
the first row of A^ = A n equal to On this event take the first such element, remove the 
corresponding column and the first row from A n and obtain the (n — 1) x (n — 1) matrix A^ 
which is independent of the first row and is distributed as A n ~\. Now repeat the step with 
^l^ 1 ) instead of A^ and proceed recursively as long as one is successful at each step. For 
< p < 1 one will not be able to proceed till step pn with probability at most 



(1 - PkY < — < 3Z exp ( - — P exp(eAVC 2 ) + A fc /C 2 

Pk \ Z 

(l—p)n<i<n 



if h is chosen large enough. The right hand side is clearly summable in h and thus almost 
surely for k large enough and n = constructed as above, the above algorithm will be 
successful for pn steps. In that case one can get lower bound on per^4 n as in the proof of 
the lower bounds in Theorem [1} The matrix at the intersection of the first pn rows and 
the removed columns is bounded from below by the product of extracted elements, that is 
t^ 1 and the matrix at the intersection of the last (1 — p)n rows and non-removed columns 
is bounded from below by ((1 — p)n)\ (since all of it's elements are greater or equal than 
1). Therefore for k large enough and n constructed as above per A n > — p)n)\. Since 

log((l — p)n)\/{n log n) — > 1 — p and 

fog 4 n > ifog^fc > pA fc _^ C 2 p 
nlogn ~ logn ~~ (1 + e)\ k /C2 1 + e' 
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we obtain that almost surely 



lim sup 



log per A 



■n 



> 



C 2 p 



+ l-p. 



n— >oo 



nlogn 



1 + e 



By sending p — > 1 and e — > we get C2 as the lower bound on the lim sup, and by Theorem 
[Hand (}3oT) it is equal to C2. 
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